Rank in Wordlist | Frequency | Word |
---|---|---|
5750 | 344 | 한-미 |
6045 | 326 | 원-달러 |
8462 | 231 | 한-일 |
10192 | 190 | 러시아-우크라이나 |
11614 | 167 | 미-중 |
13563 | 141 | 2-0으로 |
14139 | 135 | 2-1로 |
14888 | 128 | F-35A |
15325 | 124 | 2-1 |
15680 | 121 | 1-0으로 |
17349 | 108 | 2-0 |
18449 | 102 | 화성-17형 |
19653 | 94 | B-1B |
23447 | 77 | 1-1로 |
23700 | 77 | 한-중 |
23864 | 76 | 북-미 |
24269 | 74 | 0-1로 |
26517 | 67 | FA-50 |
26840 | 66 | 1-2로 |
27164 | 65 | 0-0으로 |
Rank in Wordlist | Frequency | Word |
---|---|---|
39727 | 42 | 한-미-일 |
140619 | 8 | SARS-CoV-2 |
145768 | 8 | 북-중-러 |
181146 | 6 | 미-중-러 |
196820 | 5 | 4-2-3-1 |
196821 | 5 | 4-3-3 |
196823 | 5 | 4-4-2 |
198320 | 5 | Over-The-Air |
208477 | 5 | 밀양-의령-함안-창녕 |
231789 | 4 | Best-in-class |
Rank in Wordlist | Frequency | Word |
---|---|---|
196820 | 5 | 4-2-3-1 |
208477 | 5 | 밀양-의령-함안-창녕 |
233264 | 4 | ‘관심-주의-경계-심각’ |
281473 | 3 | 4-1-4-1 |
376571 | 2 | 6-3-3-4 |
419631 | 2 | 김준수-영탁-모태범-박태환의 |
509252 | 2 | 전현무-배성재-홍현희-김동현-김민아가 |
510340 | 2 | 정무-경제-시민사회-사회-홍보 |
516935 | 2 | 중국-난사/베트남-쯔엉사/필리핀-칼라얀 |
573349 | 1 | 16강-8강-4강-결승까지 |
Rank in Wordlist | Frequency | Word |
---|---|---|
509252 | 2 | 전현무-배성재-홍현희-김동현-김민아가 |
510340 | 2 | 정무-경제-시민사회-사회-홍보 |
661300 | 1 | A-B-C-D-E로 |
667089 | 1 | E-A-D-G-B-E로 |
667779 | 1 | Eb-Gb-Db-Gb-Bb-Db |
676310 | 1 | OB-태평양-삼성-쌍방울-LG-SK-한화 |
680122 | 1 | SU-35·SU-30·SU-27·MIG-29 |
704979 | 1 | ‘도-레-미-파-솔-라-시-도’의 |
714331 | 1 | ‘부정-분노-타협-우울-수용’을 |
841739 | 1 | 기능불량-균열-들뜸/탈락-결로-누수의 |
Some languages allow the formation of longer word by composition using hyphens. Moreover, proper names may contain hyphens. Therefore we look for the most frequent words containing 1, 2, 3 or 4 hyphens.
Usually we find interesting words. But in the case of poor preprocessing there may be unexpected strings resulting from hyphenation etc. Words ending with an hyphen are usually not welcome, too.
For three hyphens:
select w_id-100,freq, word from words where word like "%-%-%-%" limit 10;
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots
3.12.4 Words containing special characters